Knowledge-Based WSD on Specific Domains: Performing Better than Generic Supervised WSD
نویسندگان
چکیده
This paper explores the application of knowledgebased Word Sense Disambiguation systems to specific domains, based on our state-of-the-art graphbased WSD system that uses the information in WordNet. Evaluation was performed over a publicly available domain-specific dataset of 41 words related to Sports and Finance, comprising examples drawn from three corpora: one balanced corpus (BNC), and two domain-specific corpora (news related to Sports and Finance). The results show that in all three corpora our knowledge-based WSD algorithm improves over previous results, and also over two state-of-the-art supervised WSD systems trained on SemCor, the largest publicly available annotated corpus. We also show that using related words as context, instead of the actual occurrence contexts, yields better results on the domain datasets, but not on the general one. Interestingly, the results are higher for domain-specific corpus than for the general corpus, raising prospects for improving current WSD systems when applied to specific domains.
منابع مشابه
Knowledge-Based WSD and Specific Domains: Performing Better than Generic Supervised WSD
This paper explores the application of knowledgebased Word Sense Disambiguation systems to specific domains, based on our state-of-the-art graphbased WSD system that uses the information in WordNet. Evaluation was performed over a publicly available domain-specific dataset of 41 words related to Sports and Finance, comprising examples drawn from three corpora: one balanced corpus (BNC), and two...
متن کاملAll Words Domain Adapted WSD: Finding a Middle Ground between Supervision and Unsupervision
In spite of decades of research on word sense disambiguation (WSD), all-words general purpose WSD has remained a distant goal. Many supervised WSD systems have been built, but the effort of creating the training corpus annotated sense marked corpora has always been a matter of concern. Therefore, attempts have been made to develop unsupervised and knowledge based techniques for WSD which do not...
متن کاملWord Sense Disambiguation using Association Rules: A Review
Now days, Word Sense Disambiguation (WSD) is a vital area which is very useful in today’s world. Many WSD algorithms are available in literature; we have chosen to an optimal and portable WSD algorithm. We are discussed the supervised, unsupervised, and knowledge-based approaches for WSD. In this paper we are discuses that association rules, Knowledge-based WSD, Corpus-based WSD.
متن کاملA New Minimally-Supervised Framework for Domain Word Sense Disambiguation
We present a new minimally-supervised framework for performing domain-driven Word Sense Disambiguation (WSD). Glossaries for several domains are iteratively acquired from the Web by means of a bootstrapping technique. The acquired glosses are then used as the sense inventory for fullyunsupervised domain WSD. Our experiments, on new and gold-standard datasets, show that our wide-coverage framewo...
متن کاملSemi-Supervised WSD in Selectional Preferences with Semantic Redundancy
This paper proposes a semi-supervised approach for WSD in Word-Class based selectional preferences. The approach exploits syntagmatic and paradigmatic semantic redundancy in the semantic system and uses association computation and minimum description length for the task of WSD. Experiments on Predicate-Object collocations and Subject-Predicate collocations with polysemous predicates in Chinese ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009